The Segmentation and Classification of Story Boundaries in News Video
نویسندگان
چکیده
The segmentation and classification of news video into single-story semantic units is a challenging problem. This research proposes a two-level, multi-modal framework to tackle this problem. The video is analyzed at the shot and story unit (or scene) levels using a variety of features and techniques. At the shot level, we employ a Decision Tree to classify the shot into one of 13 pre-defined categories. At the scene level, we perform the HMM (Hidden Markov Models) analysis to eliminate shot classification errors and to locate story boundaries. We test the performance of our system using two days of news video obtained from the MediaCorp of Singapore. Our initial results indicate that we could achieve a high accuracy of over 95 % for shot classification. The use of HMM analysis helps to improve the accuracy of the shot classification and achieve over 89% accuracy on story segmentation.
منابع مشابه
Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing
News story parsing is an important and challenging task in a news video library system. In this paper, we address two important components in a news video story parsing system: shot boundary detection and anchorperson detection. First, an unsupervised fuzzy -means algorithm is used to detect video-shot boundaries in order to segment a news video into video shots. Then, a graph-theoretical clust...
متن کاملFeature Selection for Trainable Multilingual Broadcast News Segmentation
Indexing and retrieving broadcast news stories within a large collection requires automatic detection of story boundaries. This video news story segmentation can use a wide range of audio, language, video, and image features. In this paper, we investigate the correlation between automatically-derived multimodal features and story boundaries in seven different broadcast news sources in three lan...
متن کاملAutomatic Story Segmentation for Spoken Document Retrieval
We have been working on speech retrieval based on Cantonese television news programs. Our video archive contains over 20 hours of news programs provided by a local television station. These programs have been hand-segmented into video clips, where each clip is a self-contained news story. The audio tracks in our archive are indexed by Cantonese speech recognition. This is integrated with a vect...
متن کاملUnsupervised and Model-Free News Video Segmentation
Based on a simple temporal structural model of news program, this paper presents a practical solution to automatic news story segmentation by integrating syntactic and semantic methods. First, a syntactic segmentation method is used to detect the shot boundaries in order to partition video frames into video shots. Then a semantic segmentation method based on the graph-theoretical cluster analys...
متن کاملDiscovery and Fusion of Salient Multi-modal Features towards News Story Segmentation
In this paper, we present our new results in news video story segmentation and classification in the context of TRECVID video retrieval benchmarking event 2003. We applied and extended the Maximum Entropy statistical model to effectively fuse diverse features from multiple levels and modalities, including visual, audio, and text. We have included various features such as motion, face, music/spe...
متن کامل